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ROTATION  DESIGNS  FOR  SAMPLING  ON  REPEATED  OCCASIONS* 


J.  N.  K.  Rao  and  Jack  E.  Graham 
Iowa  State  University 

i.  raraorwcnoN 

A 

There  are  many  studies,  notably  in  sociological  and  economic  research, 

*  which  are  concerned  with  estimating  characteristics  of  a  population  on 

’repeated  occasions'  in  order  to  measure  time-trends  as  well  as  the  current 

values  of  characteristics  as  a  time  series.  For  example,  the  Current 

Population  Survey  (C.P.S.)  of  the  Bureau  of  the  Census  is,  among  other 

things,  concerned  with  estimating  monthly  and  yearly  changes  in  the  level 

of  unemployment .  Estimation  problems  in  time  series  analysis  have,  of 

course,  received  considerable  attention  for  seme  time  but  most  of  these 

studies  are  devoted  to  the  estimation  of  parameters  in  time  series  models 

usually  involving  the  concepts  of  infinite  populations.  On  the  other  hand, 

most  of  the  distribution  free  theory  of  sampling  finite  populations  is 

concerned  with  estimates  applying  to  a  population  at  one  particular  time, 

and  applies  to  so-called  ’one  shot  surveys'.  In  this  study  we  attempt  a 

combination  of  finite  population  sampling  with  time  series  analysis.  This 

activity,  sometimes  referred  to  as  'sampling  on  repeated  occasions',  has 

already  received  same  attention  although  sane  of  the  references  we  discuss 

*  below  are  only  concerned  with  infinite  population  models. 

i  When  the  same  population  is  sampled  repeatedly,  the  opportunities  for 

a  flexible  sample  design  are  increased.  For  example,  on  the  h  occasion 

we  may  have  parts  of  the  sample  that  are  matched  with  the  (h-l)th  occasion, 

parts  that  are  matched  with  both  the  (h-l)th  and  the  (h-2)th  occasions, 
*Research  sponsored  by  Army  Research  Office,  Durham,  N.  C. ,  under 

Grant  No.  DA-AR0(D)-31-124-G93. 
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and  soon.  Such  a  method  of  partial  matching  has  "been  termed  "sampling 
on  successive  occasions  with  partial  replacement  of  units"  {[5]  ),  "rotation 
sampling"  ( [2] ),  and  "sampling  for  a  time  series"  ([>]).  The  optimum  re¬ 
placement  policy  for  two  occasions  has  been  investigated  by  Jessen  [  4  ]  and 
the  general  problem  of  replacement  for  more  than  two  occasions  has  been 
examined  by  Yates  [7]>  Patterson  [5],  Eekler  [2],  and  several  others. 

However,  the  theory  has  been  almost  exclusively  confined  to  infinite 
populations.  A  good  summary  of  these  papers  is  given  by  Cochran  [l]. 

In  the  Current  Population  Survey  conducted  monthly  by  the  U.  S. 

Bureau  of  the  Census  (Hansen  et  al.  [3]  ),  a  rotation  sampling  design  is 
Imposed  within  each  primary  mainly  for  the  purpose  of  reducing  response 
resistance  (which  may  occur  if  the  same  panel  of  households  is  interviewed 
indefinitely),  and  to  reduce  the  within  primary  component  of  variance  of  "the 
estimates  under  certain  circumstances.  The  rotation  pattern  is  as  follows: 
eight  systematic  sub-samples  (rotation  groups)  of  segments  are  identified 
for  each  sample.  A  given  rotation  group  stays  in  the  sample  for  four 
consecutive  months,  leaves  the  sample  during  the  eight  succeeding  months, 
and  then  returns  for  another  four  consecutive  months.  It  is  then  dropped 
from  the  sample.  Under  this  system  of  sampling,  75  per  cent  of  the  sample 
segaents  are  common  between  consecutive  months  and  50  per  cent  are  common 
between  the  Bame  months  of  two  consecutive  years. 

The  composite  estimator  of  the  population  mean,  V  of  the  current 
month  (denoted  by  0)  in  this  survey  is 

*0  "  Q(*ll  +  *0,-1  '  *-l,0>  +  (1  "  *0 

where  Q  Is  a  constant  weight  factor  with  0i.Q  <•  1,  xQ  is  the  estimator 
based  on  the  entire  sample  for  the  current  month,  0,  Xq 


is  the  estimator* 
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for  the  current  month  but  based  on  the  sample  segments  common  to  both  months 
0  and  -1,  x  ,  »  is  the  estimator  for  the  previous  month,  -1,  but  based  on 
the  sample  segments  cannon  to  both  months  0  and  -1,  and  x'^  is  the  composite 
estimator  for  the  previous  month,  -1.  The  composite  estimator  of  the  change, 


X0  -  Xml,  is 


d0 


(2) 


The  canposite  estimators  take  advantage  of  the  information  obtained  oh 
previous  occasions  as  veil  as  the  infoxmation  from  the  current  occasion, 
and  results  in  smaller  variances  for  both  the  current  estimate  and  the 
estimate  of  change  for  most  of  the  characteristics ;  the  larger  gains  are 
usually  achieved  for  the  estimate  of  change .  Hansen  et  al.  [3]  have  given 
the  variance  of  x^  and  d^,  for  their  particular  rotation  pattern,  under 
same  simplifying  assumptions  and  investigated  the  reduction  in  the  vithin 
primary  component  of  the  variance  of  and  d*  for  some  alternative  assumed 
correlations  between  months  and  for  selected  values  of  Q. 

The  purpose  of  the  present  study  is  to  develop  a  unified  finite  pop¬ 
ulation  theory  for  the  composite  estimators  and  d^.  We  formulate  a 
general  rotation  pattern  and  obtain  the  variance  of  the  canposite  esti¬ 
mators  explicitly.  We  also  obtain  the  optimum  values  of  Q  under  certain 
simplifying  assumptions  regarding  the  correlation  pattern  from  occasion  to 
occasion.  The  variance  for  the  particular  rotation  pattern  used  by 
Hansen  et  al.  can  be  obtained  as  a  special  case  from  the  general  results. 

Onate  [6],  in  developing  multistage  designs  for  the  Philippine  Sta¬ 
tistical  Survey  of  Households,  divided  each  sample  barrio  (a  second  stage 
unit  that  corresponds  to  a  township  in  the  United  States)  into  a  small 
number  of  segments  (less  than  10)  and  specified  a  particular  rotation 
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pattern  for  the  segments.  This  rotation  pattern  vas  mainly  intended  to 
reduce  the  response  resistance  of  panel  households,  for  it  was  observed 
that  the  households  tended  to  be cane  uncooperative  during  the  third  or 
fourth  visit.  Moreover,  Onate  developed  a  finite  population  theory  for 
the  composite  estimator  of  the  barrio  total  for  his  special  rotation 
pattern  (which  is  a  special  case  of  the  general  rotation  pattern).  We 
follow  Onate' s  approach  in  developing  a  finite  population  theory  for  the 
general  rotation  pattern.  We  give  here  the  results  for  single-stage 
designs,  but  the  extension  to  multistage  designs,  etc.,  is  relatively 
straightforward  and  therefore  is  not  presented  here. 

2.  THE  GENERAL  ROTATION  PATTERN 

We  assume  that  the  actual  units  in  the  population  remain  unchanged  in 
time,  and  denote  by  N  and  n  the  population  and  sample  sizes,  respectively 

o 

(which  are  the  seme  for  all  occasions).  Also,  let  N  and  n  be  multiples  of 
ng(>_  l).  Then  the  rotation  pattern  is  as  follows:  A  group  of  ng  units 
stays  in  the  sample  for  r  occasions  (n  =  ngr),  leaves  the  sample  for  m 
occasions,  comes  back  into  the  sample  for  another  r  occasion,  then  leaves 
the  sample  for.  m  occasions,  and  so  on.  If  a  unit  returns  to  the  sample 
after  having  dropped  out  (k  -  l)  previous  times  from  the  sample,  we  say 
the  unit  is  in  the  k  cycle.  We  only  consider  the  case  m  >  r  here,  for 
case  m  <  r  is  more  complicated  and  less  viseful  in  practice.  The  maximum 
value  of  m  is  r(^  -  l)  and,  if  m  is  less  than  the  maximum  value,  it 
amounts  to  covering  only  a  fraction,  viz.,  n^m  +  n  of  the  N  units  in  the 
rotating  design. 

Til  i  attraction  -  Let  N  ■  5,  n  ■  r  ■  2  and  m  ■  3.  Figure  1  illustrates 


5 


the 

the  rotation  pattern  for  a  particular  numbering  of/5  units  in  the  population. 
There  are  5 J  possible  permutations  corresponding  to  the  5!  ways  of  numbering 
the  units  in  the  population.  The  numbers  1  to  5  are  assigned  to  the  5  units 
in  the  population.  These  permutations  provide  the  stochastic  input  into 
the  sampling  design  since  it  amounts  to  saying  that  the  sample  consists  of 
one  permutation  selected  at  randan  fran  the  finite  population  of  5* 
permutations . 


Figure  1. 

Rotation  pattern  with 

N  -  5, 

n  *  r  -  2  and  m  =  3« 
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In  general,  the  finite  population  is  composed  of  the  NJ  possible 
permutations  and  the  sample  consists  of  one  randomly  selected  permutation 
frarn  this  population. 

3.  THE  COMPOSITE  ESTIMATORS  x'Q  and  d» 

Let  x  ,  denote  the  value  of  the  characteristics  for  the  unit  on 

Qy 

the  a  occasion,  (a  >  0,  -1,  ...,  -u;  j  «*  1,  ...,  N),  where  -u  denotes 
the  occasion  at  which  sampling  first  takes  place.  We  assume  that  u  is 
large  in  order  to  simplify  the  derivation  of  the  variance  formulae. 

The  ccmpoaite  estimator  of  the  current  occasion  population  mean,  XQ,  is 
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Q(i 


+  x, 


0,-1  ‘  x-l,0 


,)  +  (1  -  Q)  xr 


(3) 


■where 


J  xo,/>  ' 


x,  n  -  2^  x-  /n. ,  x.  ,  ,,  »  I?”  x  .  ,/ru,  x 

a,o-l  ^  a,,)'  l'  cir l,a  ^  a-l,j'  jL  a 

x’^  is  the  composite  estimator  for  occasion  -1,  and  0  <  Q  <1. 

Here  n^  ■  n^(r  -  l)  is  the  number  of  units  matched  between  occasions  a  -  1 
and  a.  The  composite  estimator  of  the  population  change,  XQ  -  X  ^  is 


-  x* 


-1  * 


Now  x*  can  be  written  as 


-u 


x0  -  2nQ““Wa 

Q>=  0 


-u  N 

^  ^  ^ cc  k  xa  k 

» iO  k-1  a,k  a,k 


(4) 


(5) 


where 


Wa  -  Q<xa,a-l  "  ^i,a>  +  &  “  xa 


(6) 


for  a  =  0,  -1,  -  (u  -  l),  W  =  x  and  the  w^  ,  are  functions  of  Q, 

7  -u  -u  a,k 

r  and  ng.  Fran  (5)  and  (6)  it  may  be  verified  that  the  weights  wQ  are  as 
follows: 

For  the  current  occasion,  a  ■=  0, 

(a)  wQ  ^  *  (l  -  Q)/n  for  ng  units  (first  visit  of  a  cycle) 


(b)  vn  ,  ■  (l  -  Q)/n  +  0/n.  for  n_  units  (second  to  rth 

’  visit  of  a  cycle) 


(7) 


(c)  v0>k  -  0 


for  N  -  n  units  (not  in  the  sample) 
For  occasions  a  =*  -1,  ...,  -  (u  -  l), 


(a)  v  .  S&  .-Jfl  _  301  for  np  units  (first  visit  of  a  cycle) 

\Xy  A  n  IL  t 


-a 


Ob)  w, 


a,k 


<L%LsJ3l 


—  for  n1  -  ng  units  (second 
x  ^  to  (r-l)th  visit  of  a  cycle)  (8) 


(c)  va,k  "  ^  + 

(d)  %k  "  0 


-aH 


— —  for  n„  units  (rtla  visit  of  a  cycle) 
nl 


for  N -  n  units  (not  in  the  sample) 
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For  occasion  -u,  w  .  =  QU(l/n  -  l/n.,)  for  n..  units  (first  to  (r-l)th  visit), 

m‘ u  f  it  XX 

w  .  =  Q  /n  for  nc  units  (r  visit),  and  v  ,  =  0  for  N-n  units  (not  in 
the  sample).  Therefore,  the  averages  of  the  weights  over  the  N!  possible 
permutations  ore 


) 


1_ 

N! 


I  ^  -  1>I  v0,k 


1 

N 


2  v. 


0,k 


1 

N 


and 


E<%*>  ■  W  E  <»  -  D!  \k 

Hence,  is  an  unbiased  estimator  of  XQ. 


estimator  of  X„ 


v-l* 


I  l  wa,k  =  °'  a  <  °- 

Similarly,  d^  is  an  unbiased 


4.  VARIANCE  OF  x'Q  AND  OPTIMUM  Q 

4.1  Variance  of 

Since  u  is  assumed  to  be  large,  the  variance  obtained  by  replacing  -u 
by  -oo  will  be  virtually  the  same  as  the  true  variance.  Therefore,  the 
variance  of  x^,  V(x*j),  is 
V(xq)  =  E(x£2)  -  X* 


4  2  2  E<v“’k)  +  XX  E<v“’* Vk>  Vk 

=0 

-oo  n 

+  »0  X  E<v“’*  V*'  ’  V* 

=*1 


-0D  N 

c^n'  k/k' 


N 


N 


a,k  a'k' 

ca’k* 

(9) 


xa,k  xafk‘ 


Now,  since  E  wn  ,  =  1  and  2  w .  =  0,  a  <  0, 

E^W0,k  w0,k'^ ""  n(n  -  1)  ■  E^w0,k^ 

E(wa,k  wa,k' }  =  -  FTT  E(va,k}'  a  <  0 


(10)  , 


and 


"a,k  "a'k*'  “  '  N  -  1  E^va,k  wa*k^;  a+  a'  m  °>  -!>•••» 
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Using  (10)  in  (9),  we  have 

vfiTj)  -  S*  [  I  E(»^k)  -  $  ]  ♦  »  4 


♦  »  2  E,Vk  Vi1  sa,a* 


o^a'=0 


(11) 

is  the  mean 


p  "til 

where  S  is  the  mean  square  for  the  a  occasion  and  S  , 

CC  '**f'~* 

product  between  occasions  ot  and  qj*  •  Note  that  the  variance  formula  (ll)  is 

N 

quite  general  and  is  applicable  for  an£  set  of  weights  va  k  such  that  2  w^k  «  0, 
N  1 

a  <  0,  and  Zw.  .  =1. 

1  0,k 

For  the  general  rotation  pattern  the  weights  wQ^k  are  given  by  (7)  and 

the  weights  w  (a  <  0)  are  given  by  (8).  Using  these  weights,  we  evaluate 

a,  it 

E(w^k)  and  E(wa?k  w^).  Now 


(12) 


and 


*  *(v>  - 2  4,k 


n„ 


.-2a 


(Q2  +  2  Q  +  1),  a  <  0. 


Noting  that  N  E(w^k  v^)  °  2  w^k  wa,k,  and  using  the  general  rotation 

pattern  and  the  weights  in  (7)  and  (8),  we  obtain,  after  simplification, 

the  expectations  E(w^  k  w^)  given  in  Appendix  A.  For  example,  since  n1 

units  are  cctmnon  between  occasions  0  and  -s(r  +  m)  -  1,  s  ■  0,  . •♦,  oo, 

N 

N  E  ^W0,k  w-s(r+m)-l,k^  =  E  w0,k  w-s(r+m)-l,k 

-  «»!«*>*![  .  A,  (1^2  ..  J) 


*  <“i  -  ^ 


%  (1  a)2  eB(r*”)+1  . 

"l 


“2 

n 
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Substitution  in  (ll)  of  the  expectations  (12)  and  those  of  Appendix  A  yields 

the  variance  formula  in  Appendix  B„  It  is  interesting  to  note  from  Appendix  B 

that  the  effect  of  the  finite  population  of  size  N  on  V(x^)  is  simply  to 

subtract  ~S^/N  from  the  variance  of  x^  derived  by  assuming  N  infinite. 

The  variance  formula  (B-l)  is  very  unwieldy  and,  therefore,  we  consider 
2 

two  models  for  S  and  S  ,  which  simplify  the  variance  formula  and  enable 

Ct  Q»*  u 

the 

the  evaluation  of/ optimum  Q. 


5 a  "  S0>  Sa,afS  =  S0,3  8114  S0,p 


o2 

P  s0 


Model  1.  We  first  consider  an  exponential  correlation  pattern.  The  model  is 

(13) 

QC,  P  —  “1,  “*2,  .  .  •  ,  “CO . 

This  model  has  been  vised  by  Patterson  [5]  and  others,  and  appears  to  be  a 
satisfactory  assumption  in  many  practical  situations.  Equation  (13)  is  a 
special  case  of  a  more  general  model  =  SQ  pa  with  p  =  1  +  £  ,  where  £  is 
a  small  positive  number.  Our  calculations,  however,  have  shown  that  the 
variance  with  %  =  0  is  practically  the  same  as  the  variance  with  a  small  S 
(say  0.03).  Therefore,  we  confine  our  study  here  to  the  simpler  model,  (13). 
With  this  model,  the  variance  of  x£>,  given  in  Appendix  B,  reduces  to 


v(x’ )  -  (r  -  i)  s®  + 


'o'  "  'h  -  r  “0  T  jT27I  Ts 


2n|<lS^ 

T 


ncn£(l  -  Qc)(l  -  Qp)c  1  -  (Op)™ 

Q2  [  rp2  -  (r2  +  l)p  +  r]  +  Q  [  r(r  -  l)  p2  -  2(r  -  l)  p  +  r(r  -  l)J 
-  (r  -  l)2  p  +  Qr  p*""1  (  Q2  [  -  (r  -  l)  p2  +  r(r  -  l)  pj 


+  Q  [  -  (r2  -2r  +  2)p2  +  2rp-  r2J  -  (r  -  l)  p2  +  r(r  -  l)  p  ) 


+  Q01  pm+4C  [”  r2p2  +  r(r  +  l)  p  -  r  ]  +  Q2  [  2r(r  -  l)  p  -  (r-l)(r+l)] 
+  Q  [-  r(r  -  l)  p  -  (r  -  l)(r  -  2)]  +  (r  -  l)2  )  +  Qm+r  pm+rC  Q5  [  r(r  -  l)p2 
-  r(r  -  l)  p  ]  +  Q2 [  r  p2  -  (2r2  -  r  +  l)  p  +  r2}  +  Q  f(r  -  l)(r  -  2)  p 


+  r(r  -  1)]  +  (r  -  1)  p  -  r(r  -  l)}} 


(14) 
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For  moderate  and  large  values  of  m  (which  Implies  that  N  is  large),  the 
following  formula  for  V(x^),  obtained  fran  (l4)  by  substituting  m  =  oo 

p 

and  ignoring  the  finite  population  correction  term  (i.e.,  the  term  -Sq/n), 
is  a  good  approximation: 


“S  {  Q2  Cr  p2  -  (r2  +  1)  P  +  r] 


S0  *4*4 

^ '  ~  -  <Au  -  *>■ 

+  Q  Cr(r  -  l)  p2  -  2(r  -  l)  p  +  r(r  -  l)}  -  (r  -  l)2  p  +  Qrpr"1 . 


(  Q2  [-  (r  -  l)  p2  +  r(r  -  l)  p  ]  +  Q  [  -  (r2  ■»  2r  +  2)  p2  +  2r  p  -  r2] 
-  (r  -  1)  p2  +  r(r  -  l)  p  }}  .  (15) 


Our  calculations  in  sub-section  4.2  indicate  that  for  moderate  and  large 
values  of  m,  V(x^)  as  obtained  from  (15)  differs  very  little  frcm  that 
obtained  from  (14),  and  the  optimum  values  of  Q  are  unaffected  by  m  in 
most  cases. 

The  special  case  xa>k  »  x^,  a,a*  -  0,  -1,  ...,  -co,  provides  a  check 
on  (l4).-  Then  x£  reduces  to 

-00 

xj  -  (1  -  Q)  E  Q"“x 
c*»0 


and 


v(%) 


(1  -  a)2[ 


-00 

Z 

onO 


Q"2a  V(xa) 


+  2 


-00 

Z 


Cov(x-a,  x.a,)J 


(16) 


Now 


^  -  £  -  i>  so 


Gov(V  ^s(r+m)-t) 


r  n2(r  ■  t) 

— "  n 


Gov(V  *os(r+m)-t^  “  -  T 


and 


V”***  *o..(r4.)-m-t>  -  -  |)  so 


2 

Sq  for  t  <  r 

for  r  <_  t  <  m 

for  1  <  t  <  r. 
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Substituting  these  values  into  (l6),  we  find,  after  simplification. 


W  -  <M>  so  -  -5— k 


[»- 


+  Q‘ 


r-na 


]• 


nc(l  -  Qc)(l  -  Qr4“) 

The  special  case  xQ  ^  ®  xa,k  implies  p  ■  1.  With  p  *  1,  (14) reduces  to 

(17),  thus  providing  a  check. 

The  per  cent  gain  in  efficiency  of  xi  over  x.  is 


vfcQ)  -  V(x^) 
V(x’) 


x  100 


(18) 


where 


v(*0>  -  (;  -  S>  s0  •  (w) 

Model  2.  We  now  consider  an  arithmetic  correlation  pattern  similar  to  that 
of  Hansen  et  al.  [3].  As  mentioned  earlier,  the  empirical  results  with 
Model  1,  given  in  sub-section  4.2,  indicate  that  V(x^)  computed  from  (l4) 
with  m  ■  00  differs  very  little  from  V(x^)  with  moderate  m,  and  the  optimum 
Q  is  unaffected  in  most  cases.  In  order  to  simplify  the  discussion  we, 
therefore,  confine  ourselves  here  to  an  arithmetic  correlation  pattern  and 
to  the  variance  of  x£  with  m  *  00,  The  model  is 

Sa"  S0'  Sa,afp  =  S0,p’  a,P  “  -1'  •••*  "®;  *  “  1»  2>  "•>  r-1  ^ 

S0,a  “  [  P  +  (a  +  l)  1  ]  S2  for  -  (a  +  l)  d  <  p 

=  0  for  -  (a  +  l)  d  >  p 


where  d  is  a  small  positive  number.  Using  (20)  in  the  general  variance 
formula,  given  in  Appendix  B,  with  m  =  00 ,  and  neglecting  the  finite  popula¬ 
tion  correction  term,  we  obtain 

g2  2S*  r 

V(x')  -  ~  + - g-  —  p - ?  j  Q2  [  r(r-l)  -  r(3r-4)Q  +  3r(r-2)  Q2 

0  n  nr(r-l)2(l-Q2)(l-Q)5 

-  r(r-4)Q^  -  r  Q*4']  +  Q  p  [  -  (r-l)2  +  (r-l)(2r-3)  Q  +  (2r-3)  Q2  -  (2r2-4r-l)Q^ 
+  r(r-4)Q^+rQ^  ]  +  Q2d[  (r-l)(r-2)  -  (r-2)2Q  -  (r2-2r-2)Q2  +  r(r-4)Q^  +  rQ^j } 
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2Sn  <4  4  f  2  2  2 

+  1  ■  -a;  1  ■,  ’i  -  r(r-2)  +  (3r  -8t+2)q  -  (3r  -10r+4)Q* 

nr(r-l)(l-Q2)(l-Q)5 

oc, 2  -T+ 1 

+  (r-4r+2)Q3  1  +  — ? - *—  .  (2l) 

nr(l-Q  ) 

The  per  cent  gain  in  efficiency  of  x^  relative  to  xQ  may  be  computed  from 
(l8)  using  (21)  and  (19)  without  the  finite  population  correction. 


4.2  Optimum  Q,  and  per  cent  gain  in  efficiency 

Model  1.  Using  an  IEM  7074  with  a  20k  memory  we  computed  the  per  cent  gain 

in  efficiency  from  (14),  (l8)  and  (19)  for  Q  =  0.l(0.l)0.9,  p  -  0.5(0.l)0.9, 

r  a  2(1)4(2)8  and  m  =  sr  with  a  =  l(l)4,  oo.  In  computing  the  per  cent 

2 

gain  in  efficiency  we  Ignored  the  term  -S./N  in  (l4)  and  (19),  since  it  does 
not  change  the  optimum  value  of  Q,  The  effect  of  ignoring  -Sq/n  is  to 
decrease  the  per  cent  gain  in  efficiency.  The  optimum  Q  for  each  combination 
(r,  p,  m)  is  that  value  of  Q  which  gives  the  maximum  per  cent  gain  in 
efficiency.  The  optimum  Q  given  below  is  correct  to  only  the  first  decimal 
place.  However,  it  is  quite  satisfactory  since  small  deviations  from  the 
true  optimum  lead  only  to  a  very  small  loss  in  efficiency.  Table  1  gives 
the  per  cent  gain  in  efficiency  aid.  the  optimum  Q  for  the  selected  values 
of  r,  m  and  p.  Several  Interesting  results  emerge  from  Table  1.  First, 
the  value  of  V(x^)  for  moderate  K  is  virtually  the  same  as  that  for 
m  =  oo  obtained  from  the  simplified  formula  (15).  Secondly,  the  optimum  Q 
is  unaffected  by  m  (except  for  the  case  r  =  4  and  p  »  0.9  where  the  optimum 
Q  deviates  by  0.1  frcm  m  »  4  to  m  *»  8).  Thirdly,  the  optimum  value  of  r 
is  2.  However,  if  we  are  interested  in  estimating  the  change  simultaneously 
then  r  =  2  may  not  be  optimum.  Moreover,  other  practical  considerations  may 
acmetimes  warrant  the  use  of  an  r  other  than  2.  Finally,  the  gains  in 
efficiency  of  x£  over  x^  are  only  moderate,  even  with  a  fairly  high  p. 


13 


Table  1.  Per  cent  gain  in  efficiency  of  over  xQ,  and  in  perantheses 
optimum  Q 


p 

m 

0.5  . 

0.6 

0.7 

0.8 

0.9 

2 

5. 2(0. 2) 

r  =  2 

8. 5(0. 2) 

14.2(0.3) 

22.8(0.4) 

38.8(0.5) 

4 

5. 3(0.2) 

8, 7(0. 2) 

15.3(0.3) 

26.7(0.4) 

50.6(0.5) 

8 

5. 3(0.2) 

8. 7(0, 2) 

15.3(0.3) 

27.1(0.4) 

55.7(0.6) 

00 

5. 3(0.2) 

8.7(0. 2) 

15.3(0.3) 

27.1(0.4) 

56.2(0.6) 

3 

4. 2(0.2) 

7.2(0. 3) 

12.0(0.4) 

20.2(0.5) 

36.6(0.6) 

6 

4. 2(0.2) 

7.3(0. 3) 

12.4(0.4) 

22.3(0.5) 

45.3(0.6) 

9 

4. 2(0.2) 

7»3(0.3) 

12.4(0.4) 

22.4(0.5) 

46.7(0.6) 

00 

4. 2(0.2) 

7. 3(0. 3) 

12.4(0.4) 

22.4(0.5) 

47.0(0.6) 

4 

3.3(0.2) 

r  ■  4 

5. 8(0.3) 

9, 8(0,4) 

17.1(0.5) 

32.4(0.6) 

8 

3.3(0.2) 

5.8(0. 3) 

9. 9(0.4) 

17.8(0.5) 

38.6(0.7) 

12 

3. 3(0,2) 

5. 8(0.3) 

9. 9(0.4) 

17.8(0.5) 

39.8(0.7) 

00 

3.3(0.2) 

5. 8(0.3) 

9* 9(0.4) 

17.8(0.5) 

40.0(0.7) 

6 

2. 3(0. 3) 

r  ■  6 

4. 0(0, 3) 

6, 7(0.4) 

12.0(0.6) 

26.0(0.7) 

12 

2. 3(0. 3) 

4, 0(0. 3) 

6. 7(0.4) 

12.2(0.6) 

28.3(0.7) 

18 

2. 3(0. 3) 

4.0(0. 3) 

6, 7(0. 4) 

12.2(0.6) 

28.5(0.7) 

CO 

2.3(0.3) 

4. 0(0.3) 

r  =  8 

3. 0(0. 3) 

6. 7(0.4) 

12.2(0.6) 

28.5(0.7) 

8 

1.8(0, 3) 

5. 0(0.4) 

9. 2(0. 6) 

20.6(0.7) 

16 

1.8(0.3) 

3.0(0. 3) 

5. 0(0. 4) 

9. 3(0.6) 

21.3(0.7) 

00 

1.8(0.3) 

3» 0(0, 3) 

5. 0(0. 4) 

9.3(0.6) 

21.3(0.7) 
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Model  2.  From  (2l),  (l8)  and  (19)  we  computed  the  optimum  Q  (using  the 
same  procedure  as  in  Model  l)  for  d  *  0.05(0.05)0.20,  p  =  0.6(0.l)0.9  and 
r  =  3  and  4.  With  r  =  2,  the  per  cent  gain  in  efficiency  and  the  optima 
Q  are  the  same  for  both  models.  Table  2  gives  the  per  cent  gain  in 
efficiency  and  the  optimum  Q  for  the  selected  values  of  d,  p  and  r.  Also, 
the  per  cent  gain  in  efficiency  and  the  optimum  Q  for  Model  1  with  m  »  00 
are  given  for  comparison. 


Table  2.  Per  cent  gain  in  efficiency  of  xj  over  xQ,  and  in 
parentheses  optimum  Q 


p 

d 

0.6 

0.7 

0.8 

0.9 

0.05 

8. 7(0.4) 

15.4(0.5) 

28.7(0.6) 

56.1(0.7) 

0.10 

8. 5(0.4) 

14.4(0.4) 

25.0(0.5) 

46.8(0.6) 

0.15 

8.0(0.5) 

15.5(0.4) 

22.9(0.5) 

41.2(0.6) 

0.20 

7. 6(0,5) 

12.6(0.4) 

20.8(0.5) 

56.1(0.6) 

Model  1 

7. 5(0.5) 

12.4(0.4) 

22.4(0.5) 

47.0(0.6) 

r  =  4 

0.05 

8. 0(0, 4) 

14.5(0.5) 

25.9(0.6) 

49.5(0.7) 

0.10 

7. 1(0.4) 

12.5(0.5) 

21.2(0.6) 

57.0(0.7) 

0.15 

6. 5(0.4) 

10.7(0.4) 

17.9(0.5) 

29.9(0.6) 

0.20 

5. 5(0.4) 

9. 8(0.4) 

15.8(0.5) 

25.0(0.6) 

Model  1 

5. 3(0.5) 

9. 9(0. 4) 

17.8(0.5) 

40.0(0.7) 

Table  2  shows  that  model  1  is  fairly  robust  to  moderate  deviations  as 
shown  by  the  alternate  arithmetic  model  calculations,  and  the  optimal  Q  for 
models  1  and  2  are  either  the  same  or  deviate  only  by  0.1. 
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4.3  Variance  of  x^  for  the  Current  Population  Survey  rotation  pattern 


As  described  in  Section  1,  the  rotation  pattern  for  the  Current  Popula¬ 
tion  Survey  is  as  follows:  eight  rotation  groups  each  of  ng  units  are 
identified  for  each  sample  and  a  rotation  group  stays  in  the  sample  for 
four  consecutive  months,  leaves  the  sample  during  the  eight  succeeding 
months,  returns  for  another  four  consecutive  months  and  then  drops  out. 
Therefore,  the  sample  size  is  n’  ■  8n2  =  2n,  r  =  4  and  m  «  8,  '  On  any 
occasion,  4ng  of  the  units  in  the  ssmple  are  in  the  first  cycle  and  the 
remaining  4n2  units  are  in  the  second  cycle  (since  there  are  only  two 
cycles  for  each  rotation  group).  The  composite  estimator  can  be  written 


as 


-OD  N 
2  2 
0.0  k-1 


va,k  *a,k 


with  variance 


VGJ)  .  [l.  E(r*  >  -  i  ]  S*  *  E  1  B(r* )  8* 

Qb-JL 


-00 

+  N  2  E(v, 

c^a’ 

-0 


a,k  va*k^  Sa,a» 


(22) 


where  v. 


a  k  ■  wa  j^2  and  va  ^  are  the  weights  defined  in  (7)  and  (8),  Also, 

^k*  “  i  E(wa,k}'  E(va,k  W  "  \  E(wa,k  W  far  a’  "  0hl»  c>2’  ^ 


E(va,k  W 


i  E(%k  va»k^  for  *  “  a~9>  <>15;  Q^a’  ”  °>  ml*  •••>  m°°> 


and  all  other  expectations  are  zero.  Therefore,  using  the  expressions  for 
E(va,k  va«k)  and  E(va  k  r^)  the  variance  of  x^  is  easily  obtained  frm  (22). 


5.  VARIANCE  OP  d»  AND  OPTIMUM  Q 

5.1  Variance  of  d^ 

The  composite  estimator  of  the  change,  XQ  -  X  ^,  is 
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with  variance 

V<d£)  .  vG%)  +  V(x:i)  -  2  Cov(x^,  x^). 

The  derivation  of  V(x^)  and  Cav{x^,  x’^)  is  similar  to  that  of  V(x^).  Hence, 
without  derivation,  we  have  given  the  general  formula  for  V(d^)  in  Appendix  C. 
For  Model  1  the  general  formula  reduces  to 


V(«A)  -  2(i  -  i)(l  -  p)  S2  - 


2n|  S2  (1  -  Q) 


n4n^(l  -  Qp)*  (l  -  Q*)  1  -  (Qp) 


r+m 


{_ -  (r-l)2p  +  (2r2-5r+j)  Qp  +  r(r  -  l)  Qp2  +  (2r2-r-3)  Q2p 


-  r(3r-4)  Q2p2  -  r(2r-3)  Q2  -  rQp  +  (3r2-2r+l)  Q^p 


-  r(3r-2)  Q^p2  +  r(r-l)  Q^p^  -  r(r-l)  Q^p2  +  r(r-l)  Q^p5 

+  Qrpr  1[r(r-l)  p  -  (r-l)  p2  -  r2Q  -  r(r-3)  Qp  -  (r2-3r+3)  Qp2 
+  r2Q2  +  r(r-3)  Q2p  +  (r2-3r+3)  Q2p2  -  r(r-l)  Q^p  +  (r-l)  Q^p2  3 
+  Qmpm+1  [  (r-l)2  -  (2r2-5r+3)  Q  -  r(r-l)  Qp  -  3(r-l)  Q2 
+  3r(r-l)  Q2p  +  (r2-r-l)  Q?  -  r(r- 3)  Q^p  -  r2Qpp2  +  rCp 

-  r(r+l)  Q4p  ♦  rW  J  +  Qffi+r  pm+r  [  -r(r-l)  +  (r-l)  p 

+  3r(r-l)  Q  -  3(r-l)  Qp  +  r2Q2  -  (6r2  -  7r  +  3)  Q2p  +  r(2r-l)  Q2p2 

-  r2Q^  -  (r2-2r-l)  Q^p  +  r(4r-5>  Q^p2  -  r(r-l)  Qpp'1  +  r(r-l)  Qpp 

-  r(r-l)  Q4p5J}  .  (23) 


The  special  case  xa  k  =  xa,k  provides  a  check  on  (23),  Then 


-00 


a-  .  (1  -  a)  1  «■“  Ga  -  vx) 

*0  . 

and,  using  the  formulae  for  Covfx^  )  given  in  sub-section  4.1,  we  obtain 
after  simplification 


v(aj) 


d .  Qr .  <f+  (j™)  ' 


(a*) 


17 


With  p  =  1,  (23)  reduces  to  (24),  thus  providing  a  check. 

The  per  cent  gain  in  efficiency  of  d^  over  the  estimator  dQ  based  on 
the  difference  between  the  sample  means  of  two  independent  samples  on 
occasions  0  and  -1  is 


where 


V(dQ)  -  V(d£) 
v(d^ 


100 


(25) 


Tt*0>  ■  2(n  -  5>  S0  •  <*> 

Since  the  results  in  sub-section  4.2  Indicate  that  model  1  is  fairly  robust 
to  moderate., deviations ,  we  did  not  investigate  model  2  for  change  estimation. 


5.2  Optimum  Q  and  per  cent  gain  in  efficiency 

As  in  the  case,  of  the  current  estimate,  we  ignored  the  term  involving 
l/N  in  (23)  and  (26)  and  computed  the  per  cent  gain  in  efficiency  from  (25) 
for  Q  =  0.1(0. 1)0. 9,  p  =  0.5(0. 1)0. 9,  r  «  2(l)4(2)8  and  m  =  sr  with  s  = 
l(l)4,  oc.  The  optimum  Q  for  each  combination  (r,  p,  m)  is  that  value  of 
Q  which  maximizes  the  per  cent  gain  in  efficiency.  Table  3  gives  the  per 
cent  gain  in  efficiency  of  over  dy  and  the  optimum  Q  for  d^.  The  per  cent 
gain  in  efficiency  of  aj  over  dQ  and  the  optimum  Q  for  x^  frcm  Table  1  is 
also  included  in  Table  3°  It  may  be  noted  that  in  most  situations  we  are 
forced  to  use  the  same  value  of  Q  for  both  x  and  d^.  If  the  current 
occasion  estimate  is  more  important,  we  may  prefer  to  use  the  current 
occasion  optimum  Q  for  both  d^  and  x^. 

Table  3  shows  that  considerable  gains  in  efficiency  of  d^  over  can 
be  obtained  using  the  optimum  Q  for  d^,  even  with  moderately  large  p.  Also 
the  optimum  Q  for  x^  leads  to  substantial  gains  in  efficiency  of  dj  over  dQ, 
though  the  gains  with  the  optimum  Q  for  d£  are  larger,  particularly  for 
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high  p.  Unlike  the  current  occasion  case  where  the  optimum  r  ia  2,  the  gain 
in  efficiency  of  d^  over  dQ  increases  with  r.  Therefore,  if  both  current 
occasion  and  change  are  of  interest,  we  may  prefer  to  use  an  r  other  than  2. 
The  variance  of  d^  for  moderate  and  large  values  of  m  differs  very  little 
from  the  variance  of  d^  with  m  =  oo.  The  optimum  Q  for  d^  is  unaffected  by 
the  value  of  m  in  most  of  the  cases, 

6.  CONCLUDING  REMARKS 

In  many  practical  situations,  the  exponential  correlation  pattern 
(model  l)  may  be  quite  reasonable.  Also,  a  comparison  with  the  arithmetic 
correlation  pattern  (model  2)  showed  that  the  exponential  correlation 
pattern  was  fairly  robust  to  moderate  deviations.  However,  in  a  monthly 
survey  with  characteristics  str* igly  influenced  by  seasonal  variations,  the 
correlation  between  occasions  12  monthB  apart  may  be  about  the  same  mag¬ 
nitude  (or  even  larger)  as  compared  to  the  correlation  between  consecutive 
occasions.  In  such  situations,  model  3, 

Sa,ol2j-i  “  S0,-12j-i  “  P1  p2  S0'  1  "  °’  1*  llj  J  "  °’  lj  2> 

where  Pj,  is  the  correlation  between  consecutive  occasions  and  p2  is  the 
correlation  between  occasions  12  months  apart,  may  be  more  appropriate. 

Then,  it  is  necessary  to  generalize  the  composite  estimators  x^  and  d^  and 
construct  composite  estimators  which  take  advar+age  of  both  p^  and  pg 
explicitly.  This  problem  ha.i  investigated  and  the  results  will  be 

published  in  a  later  paper,  ir.-.  generalized  composite  estimators  lead 
to  considerable  gsins  in  efficiency  over  and  d^,  when  p^  and  p 2  are  of 
approximately  the  same  magnitude. 
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In  large  scale  surveys  multistage  sampling  is  often  employed.  In  a 
two-stage  design,  rotation  sampling  is  canmonly  used  within  each  primary, 

and,  therefore,  the  composite  estimators  reduce  the  within  primary  varia¬ 
tion  only.  It  may  he  reasonable  to  assume  that  the  correlation  of 
secondaries  between  occasions  is  approximately  the  same  in  each  primary.  Then 
we  could  use  the  tables  for  the  optimum  Q  given  here  and  construct  com¬ 
posite  estimators,  with  the  same  Q,  within  each  selected  primary,  and  then 
determine  the  composite  estimator  of  the  over-all  population  mean.  For  the 
estimator  of  variance,  we  could  use  the  estimator  based  on  the  mean  square 
of  the  within  primary  composite  estimators  (this  estimator  of  variance  is 
unbiased  if  the  primaries  are  selected  with  replacement;  if  the  primaries 
are  selected  without  replacement  it  over-estimates  the  variance). 

We  wish  to  thank  Professor  H.  0.  Hartley  for  helpful  suggestions. 
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APPENDIX  B 


Variance  of 


The  variance  of  for  the  general  rotation  pattern  is 
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APPENDIX  C 


Variance  of  di 


The  variance  of for  the  general  rotation  pattern  is 
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